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ABSTRACT 

We present the first multi-parameter analysis of the spatial clustering properties of nearby galaxies 
in which accretion activity is present to different degrees. We spectroscopically identify and classify 
active galactic nuclei (AGN) from the Sloan Digital Sky Survey (SDSS) DR2 main galaxy sample, 
which yields the most precise measurement to date of AGN clustering. Estimates of the redshift space 
two-point correlation function (CF) reveal that Seyferts are clearly less clustered than normal galaxies, 
while the clustering amplitude of LINERs is consistent with that of the parent galaxy population. 
The similarities of the distributions of host properties (color and concentration index) of Seyferts and 
LINERs suggest that the difference in their clustering amplitudes is not driven by the morphology- 
density relation. We find that the luminosity of the [O I] emission shows the strongest influence 
on AGN clustering, with low i[oi] sources having the highest clustering amplitude, sq- This trend 
is much stronger than the previously detected dependence on i[oiii]i which we confirm. The fact 
that objects of given spectral types are clustered differently seems correlated with a variety of their 
physical properties including i[oi] : ^[oiii]j the electron density of the emitting gas Ue, and the 
obscuration level. LINERs, which exhibit high sqi show the lowest luminosities and obscuration 
levels, and relatively low ne, suggesting that these objects harbor black holes that are relatively 
massive yet weakly active or inefficient in their accretion, probably due to the insufficiency of their 
fuel supply. Seyferts, which have low sqj are very luminous in both [O I] and [O III] and show large 
rie , suggesting that in these systems the black holes are less massive but accrete quickly and efficiently 
enough to clearly dominate the ionization. Star-forming galaxies - the H ii's ~ are weakly clustered; 
because these systems are hosted mainly by blue, late type galaxies, this trend can be understood as a 
consequence of both the morphology-density and star formation rate-density relations. However, the 
spectral properties of the H ii's suggest that these systems hide in their centers, amidst large amounts 
of obscuring material, black holes of generally low mass whose activity remains relatively feeble. Our 
own Milky Way may be a typical such case. 

Subject headings: galaxies: active — galaxies:emission lines — galaxies: clustering — galaxies:statistics 



1. INTRODUCTION 

Clustering analysis can be a useful tool to employ in 
investigating the formation and evolution mechanisms of 
various astrophysical objects. In the case of AGN, the 
pattern of spatial clustering and its implications for the 
environments of AGN provides constraints on galaxy evo- 
lution models, and in particular on how well AGN trace 
the normal, non-active galaxy distribution. Comparisons 
of the AGN environments with those of control samples 
of regular galaxies may indicate differences that reveal 
what determines the activity in galaxy nuclei, and even 
gauge its strength and time cycle, thus yielding a phe- 
nomenological link between black hole growth and galaxy 
formation and evolution processes. 

Estimates of the two-point correlation function (CF) 
provide important diagnostics for the black hole (BH) 
masses and duty cycle of fueling in AGN. If struc- 
ture formation is hierarchical, then the rarest and most 
massive virializ ed dark matter (D M) halos cluster the 
most strongly l)e.g.. Kaiser 1 1198-^ . thus the clustering 
strength of a given object type constrains the mass 
and number density of the halos in which they re- 
side. The DM halo mass can further be connected 
to the physical properties of the resident galaxies and 
their central black holes through empirical scaling re- 
lations: the black hole mass ~ stellar velocity disper- 
sion Mbh — cr* relation that holds for normal galax- 



ies (IGehhardt et al. lIM^ IFerrarese MTTTitTI l2nf)(^ 

as well as for t he active ones ("Gebhardt et al. 20001i 

I Ferrarese et al 2001 : Nelson et al 2004: ,Onken et alJ 

120041 iGreene fc Ho I l2005|) thecorrelations between 
bulge prope rties and Mbh l)e.g.. Magorrian et al. 11199^ 
iGraham et al. 200^, and the correlation b etween cir- 
cular velocity and central velocity dispersion ('Ferrares'el 
12002: Baes et al. 2003) that links dark matter halos 
and supermassive BHs. The number density of the 
DM halos can be used to infer the duty-cycle, or the 
accretion life-time, as this is given by the inverse ra- 
tio of this number density to that of the observed ob- 
jects (Martini & Weinberg 2001). Such predictions for 
the AGN's lifetime, an d its implications for the effi- 
ciency of BH formation l)Haiman fc Hui l|2001^ . can test 
the fundamental assumptions of semi-analytical models 
of AGN and galaxy formation f Kauffmann k, Haehnelt I 
11)00; Di Matteo et al. 2003). 

Studying the spatial clustering properties of differ- 
ent spectroscopically-classified AGN species may link 
their ionization mechanisms to properties of their hosts 
and environment. Through comparison of the clus- 
tering amplitudes of accreting-type sources such as 
Seyferts and the starlight-ionized H ii galaxies, it is 
possible to contrast properties of the dark matter ha- 
los that host different kinds and levels of galactic nu- 
clear activity. Particularly interesting is the cluster- 
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ing of the ubiquitous Low Ionization Emission-Line Re- 
gions (LINERs, e.g. Heckman 1980), whose physical 
origin, whether thermal or non-thermal, remains am- 
biguous despite exhaustive analysis of their properties 
jHo "1999; Ho. FiUoDcnko & Sareent "2003; Filho et al. I 
12004: .Maoz et al. 2005 : Constantin et al. 20061 . 

Galaxy mergers are predict ed to play a crucial role in 
trigg ering AG N activity lle.g.. Barnes &: Hernauist I 

1992 IKauffmann fc Haehnelt [ I2OO0F 

Pi Matteo. Soringel fc H ernauist I l2005j) . If galaxy 
nuclei become more AGN-like when major mergers 
occur, then active galaxies of different spectral signa- 
tures, corresponding to different degrees of contributions 
from a non-thermal ionization process, may cluster 
differently, reflecting variation in their environments. 
Thus, a comparison of the clustering properties of the 
active and non-active galaxies is an important tool for 
testing the merger-accretion connection. 

Through its wealt h of both spectrosc opic and photo- 
metric data, SDSS ijYork et al. 1 1200 0» is ideal for in- 
vestigating the dependence of clustering properties on 
a variety of emission characteristics. The SDSS obser- 
vations offer the unique opportunity to examine simul- 
taneously the dependence of environment on the spec- 
tral classification of emission-line galaxies, and on vari- 
ous other features that characterize or even determine a 
detectable accretion process, in conjunction with proper- 
ties o f their host galaxies. Pre vious stud ies with similar 
aims llKauffmann et al. II2(M |2004; Mill er et al. 1l2?ffl^ 
iWake et al. 11200^ compared passive galaxies with AGN 
as a general class and did not investigate the properties 
of sub-populations such as Seyferts and LINERs sepa- 
rately. In these studies, the optical classification of an 
object as an AGN relies, especially among the narrow- 
line emitters, mainly on the [N II] /Ha vs. [O III]/H/3 
diagnostic diagram. There are suspicions that such 
rather lenient definitions may n ot best represent t he ac- 
cretion dominated population l)Hao et al. Il2005fl . Us- 
ing such definitions , the AGN popul a tion i s found by 
iMiller et al."l (|2003l) and iWake et al.1 l|2004D to occupy 
a uniform fraction of galaxies, to be common to a large 
range of galactic environments, and to follow the dis- 
tribution of the whole galaxy population, and therefore 
to be unbiased tracers of mass in the universe. How- 
ever, when divided by their ([O III]) luminosities, the 
activ e nuclei show mark edly different clustering proper- 
ties ijWake et al. 1 12004'). d ifferent star-form ation histo- 
ries in their hosts IjKauffman n el^ ^1. | l2fl03D. and differ - 
ent environmental preference ijKauffmann et al. 1 1200^ . 
Combined together, these findings seem to suggest that 
Seyferts and LINE Rs, that represent quite distinct dis- 
tributions in i[oiii] (Kauffm ann et al. 120 031. might also 
cluster differently. Such inferences are intriguing and 
thus merit further clarification. It is thus crucial to estab- 
lish and quantify the connections between the intrinsic 
physical properties of the nuclear line-emitting regions 
and those of their host galaxies and dark matter halos. 
We pursue these issues here by including important spec- 
tral diagnostics that probe the astrophysics of AGN. 

In this paper, we use SDSS DR2 data in an attempt 
to detect and analyse differences in the clustering prop- 
erties of (1) active and non-active galaxies, (2) objects 
of distinct types of activity, i.e., accretion versus star- 
formation, and (3) different spectroscopic types of AGN, 



e.g., Seyferts versus LINERs. In order to carry out such 
analysis with a good understanding of the possible bi- 
ases introduced by detection and spectral identification, 
we present in Section [21 a systematic examination of the 
different AGN selection criteria and their sensitivity to 
the dominance of an accreting-type nuclear power source. 
In Section |51we estimate and compare the redshift-space 
CF for subsamples of objects of different spectral classes 
(Section 13.1.1(1 . and investigate the degree to which these 
measurements are influenced by the level of nuclear activ- 
ity, as traced by the [O III] and [O I] luminosities (Section 
I3.1.2|l . properties of the line-emitting gas, e.g., intrinsic 
obscuration, as traced by the neutral hydrogen column 
densities Nh and the electron density rig (Section I3.1.3|l , 
and host properties (Section 13. 2() . We discuss in Section 
01 possible implications of our findings on the features 
that distinguish Seyferts from LINERs, on the peculiar 
nature of the Transition objects, that border the defini- 
tion of AGN (Seyferts or LINERs) and H 11, and on the 
potentially great similarity between the Galactic Center 
and a typical H 11 system. Section [31 summarizes the 
findings and conclusions of this work. 

2. THE DATA 

This study is based on data from the Sloan D igital Sky 
Survey (SDSS) Data Release 2 CDR2: Ab azaiian ct al.J 
120041) . The parent sample from which wc draw various 
categories of sources is a subset of the main galaxy sam- 
ple u sed for large-scale structure studies ( Strauss et al. I 
l2002t) . and includes objects with Petrosian r magnitudes 
14.5 < rrir < YI .17 afteji correction for G alactic ex- 
tincti on based on maps of jSchlegel. Finkbcinc r fc Davis I 
Il998(l . and with a redshift distribution that extends from 
~ 0.005 to ~ 0.30, with a median of z « 0.1. Thus, 
the sample does not include low redshift quasars or 
broad emission- li ne, type 1 A GN (Richards ct al. 200^ 
iSchneider et al. 1120021 12003^ . To clarify the nomencla- 
ture, what we define as Seyferts and LINERs are only 
type 2 objects, where no obvious broad component is de- 
tected, thereby leaving the narrow lines as the prominent 
spectral features. 

The core of the analysis we present here exploits the 
source spectroscopic properties and, consequently, the 
source spectral type definition. The spectroscopic mea- 
surements we employ are drawn from a catalog of ab- 
sorption and emission line fluxes and equivalent widths 
(EW) of a superset of the DR2 galaxy targets which ful- 
fill the galaxy sample selection criteria. These data are 
kindly made publicly available by the MPA/JHU collab- 
oration at http: //www.mpa-garching.mpg.de/SDSS/| 
(RBrinchmann ct al. 2004), and arc analyzed by, e.g., 
Kauff mann et al. I lj2003t l200l . This dataset has the 
advantage that the line emission component is sepa- 
rated from the host galaxy light; based on stellar pop- 
ulation synthesis templates that span a relatively large 
range in age and metallicity, the absorption-line contri- 
bution is ide ntified and subtracted f rom the total galaxy 
spectrum l|Tremonti et al. I I2004D . We should note 
here that such model-based measurements exclude alpha- 
enhancement that might result in an enhanced "bluing" 
on the evolutionary tracks that translate in a possible 
enhancement in the Hl3 absorption index by up to 0.26A 
(|Thomas fc Maraston 2003') . Such bias remains however 
negligible for the strongly line emission systems that we 



3 



consider in this study, where we employ only > 2-signia 
line flux measurements (see Section 12.1.1(1 : in such sam- 
ples, the minimum strength (equivalent width) of the H/? 
emission is I.SA, hence, even an extreme "bluing" has a 
maximum effect of 17%, and should affect only < 10% of 
the sample. 

Other observed properties of the objects used in this 
study were measured using the standard SDSS photo- 
metric and spectroscopic pipelines and were obtained di- 
rectly from the SDSS archive. We employ Petrosian mag- 
nitudes, and in calculating the absolute magnitudes from 
the apparent magnitudes m,- and the redshift z we apply 
the formula 

Mr = mr-51ogio(s(l-f'z)/lMpc)-25-i^(z)-)-51ogio/i+Q(z 

(1) 

Here K(z) is the r-band magnitude galaxy iiT-correction 
relative to th e z = 0.1 value, as calculated by 
iBlanton et al. I l|2003bj) . and made publicly available as 
part of the NYU Value-Added Catalog (Blanton ct al. 
2005, http://wassup.physics.nyu.edu/vagc/), s(l + 
z) is the luminosity distance, with s the comoving dis- 
tance, and Q{z) accounts for the average evolution in 
galaxy luminosities in the rec ent pa st , with Q(z) = 
1.6(z - 0.1) fBlanton et aTl l2003at iTeemark et aTl 
120041) . Throughout this paper we assume a cosmology 
with 17 = 0.3, A = 0.7, and = lOO/i km s"! Mpc'^ 
The distances are quoted in hr^ Mpc. 

In this study we conduct a comparative analysis of the 
clustering properties of the emission-line galaxies by em- 
ploying subsamples of galaxies, defined by absolute mag- 
nitude limits. Figure Q] illustrates the distribution of the 
whole galaxy sample (247,080 objects) in the redshift- 
absolute magnitude space. The continuous curves delin- 
eate the sample boundaries corresponding to our choice 
of apparent magnitude limits 14.5 < < 17.7. The 
bright cutoff excludes bright galaxies that may have been 
shredded into many pieces, particularly in early SDSS 
photometry, and/or galaxies that have been excluded 
from spectroscopic target selection due to fiber satura- 
tion effects, while the fai nt limit corresponds to that of 
the main gal axy sample llStrauss et al. lIlOO^ . 

Following lParket al.1 l|2005D. we define a "Best" 
volume-limited sample of SDSS galaxies that con- 
tains the maximum number of galaxies, with absolute- 
magnitude limits —21.6 < < —20.2 that correspond 
to a redshift range of 0.05 < z < 0.12, or a comoving 
distance range of 148.5 < r < 350.1 Mpc, when the 
apparent magnitude cuts are applied. The deflnition of 
our "Best" volume is shown in Figure ^as a dashed line. 
The sky area covered by the parent spectroscopic sample 
is illustrated in Figure 1 of Abazajian et al. (2004). It is 
important to note that the subsets of spectroscopically- 
defined AGN drawn from this volume-limited galaxy 
sample will not themselves be truly volume-limited sam- 
ples because of selection effects induced by their spectro- 
scopic definition; the host galaxies are restricted to the 
described range of absolute magnitude and redshift, but 
the redshift distributions of AGN may not be uniform 
because of the requirement that the emission lines be de- 
tected. These effects are discussed in detail in Section 
12. 21 and are taken into account in our clustering analysis. 

The spectroscopic subsamples of interest for this study 
suffer from the fiber collision problem. When two targets 



lie closer than 55", only one member of such galaxy pair 
will have a spectrum available, unless later spectroscopic 
tiles overlap that same area of sky. Because classifica- 
tion as an AGN requires a spectrum for each object, we 
cannot make any "correction" for such collisions, as is 
commonly done for large-scale struc ture studies of SDSS 
galaxy samples (|Zehavi et al. ll200^ . The impact of this 
problem on our estimates of the correlation functions is 
discussed later in Section 13 

2.1. Subsample Definition 
2.1.1. Active vs. Non- Active: the Spectral Classification 

The fraction of galaxies with an AGN is difficult 
) . to assess observationally, because it strongly depends 
on the quality and quantity of the available spectral 
data. Sub-classifying emission-line nebulae typically 
requires a comparison of pairs of line-flux ratios of 
spectral features that are relatively insensitive to 
reddening and metallicity, are able to distinguish be- 
tween different ionization levels, and provide a useful 
segregation of different object types. There is a set 
of line ratios that satisfy these criteria, and that 
have been widely used in quantifying emission prop- 
erties and distinguishing between accretion-powered 
and starfight-powered systems: [O III]A5007/II/3, 
[N II1A6583/Ha [S II1AA6716.6731/Hq. and 
]p I1A6300/ Ha ie.g.. Baldwin. Phillips, fc Terlevichi 
[1981; VeiUcux fc Osterbrock II1987D . 

Figure |21 illustrates how galaxies that exhibit all 6 of 
these emission lines at moderately high signal-to-noise, 
e.g., with fractional errors <50%, segregate as a func- 
tion of the se combinations of line ratios. Following the 
Ho. Filipp enko fc Sargent I l(1997aD criteria, as fisted in 
Tabled we classify the objects as H ii galaxies, Seyferts, 
and LINERs, with their Pure LINER and Transition sub- 
sets. H ii's are the galaxies that populate mainly the 
left branches of these diagrams, as they usually show 
weak low-ionization transitions in [N II], [S II], and es- 
pecially [O I]. By contrast, Seyferts and LINERs occupy 
the right branch as they exhibit relatively strong for- 
bidden features. The distinction between Seyferts and 
LINERs is usually given by their ionization level, which 
is easily gauged by the [O III]/H/3 ratio, with Seyferts 
showing larger values. The Transition systems are dif- 
ferentiated from the Pure LINERs based on their low 
[O I] /Ha ratio. Such divisions offer a basis for categoriz- 
ing the line-emitters. However, as is apparent from these 
three heavily populated diagrams, the nebular proper- 
ties span a continuous range; no breaks appear in the 
diagrams to indicate a clear-cut transition between one 
type to another. 

There are also relatively successful theoretical at- 
tempts to separate the accretion-type galaxy nuclei from 
star-forming sources, using criteria that can be de- 
scribed using these same spectra diagnostics diagrams. 
Based on models that account for large realistic metal- 
licity, ionization par ameter, and dust depletion ranges, 
iKewlev et al. I l|2001fl find that simple hyperbolic curves 
can be used to split the line-emitting galaxies into AGN 
and star-forming galaxies. As can be seen from Figure 
13 the curves claimed as upper limits on the location of 
H ii-like objects exclude only a small fraction of LIN- 
ERs from the AGN class, and are otherwise reasonably 
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consistent with the iHo. FiUppenko fc SargenTI l)1997a[l 

classification criteria. 

Previous studies of low luminosity narrow-lined AGN 
in SDSS samples used simpler criteria than we use 
here. Based on the shape of the distribution of ob- 
iects in the [N II] /Ha vs. [O III]/H/3 diagram, 
iKauffmann et al.1 <)2003(1 propose an empirical cut be- 
tween AGN and H II - type galaxies, wh ile some other 
studies (iMiUer et al.1 l200a IWake et a l. 2004) con- 
sider AGN classification schemes based only on values 
of the [N II] /Ha flux ratio (i.e., with the condition that 
[N II]/Ha > 0.6). These classifications overlook the 
emission-line behavi or as a function of [S Ilj/Ha. and 
[O I]/Ha, which. e.g..lVeilleux fc Osterbrock I l|1987j) and 
iHo. Filippenko fc Sargent I l)1997aj) argue that discrimi- 
nate better than [N II] /Ha between the young stars and 
AGN as dominant ionization sources. Of the three low- 
ionization lines, [O I] is the most sensitive to the shape of 
the ionizing spectrum, requiring a significantly hard radi- 
ation field, i.e., that of an AGN, to sustain a sufficiently 
extensive partially ionized zone in clouds optically thick 
to Lyman continuum, and thus to produce a strong such 
feature. Since the ionization potential of [O I] matches 
that of H very well, large differences in the [O I] /Ha ra- 
tios are expected between H ii region-like objects and 
narrow line AGN. The effect is also important for [S II] 
(A6717-f A6731)/Ha, however, the fact that S"*" can also 
exist within H+ zones of H ii regions and AGN atten- 
uates the difference between the two classes of objects. 
Strong [S II] AA6717, 6731 are nevertheless expected in 
AGN because of the large collisional-excitation cross sec- 
tions for these lines. [N II] has a much smaller excitation 
cross section, and thus its strength in AGN is weaker. 
Large [N II] /Ha line ratios in AGN may however also 
be caused by potential selective enhancements of nitro- 
gen (Storchi-Bcrgmann fc Pastoriza 1989, 1990), and 
thus do not necessarily arise through ionization by an 
accretion-type radiation field. 

Figure 13 illustrates the behavior as a function of 
[S II] /Ha and [O I] /Ha of the objects classified as AGN 
based on [N II] /Ha ratio like in Miller et al. (right pan- 
els), and based on the Kauffmann et al. line that sep- 
arates the two branches of the objects distribution in 
[N II] /Ha vs. [O III]/H/3 diagram (left panels). For the 
Miller et al. classification, the density contours show that 
these AGN are governed by a LINER-like behavior. Us- 
ing the spectral types defined in Table ^ the fraction 
of LINERs is at least 30%, while Seyferts comprise less 
than 9% of these AGN, in both their '2-hne" AGN (only 
detection of 2 line-emissions, [N II] and Ha is required) 
and "4-line" AGN (additional detection of [O III] and H/3 
emission lines is required) samples. Also, the fraction of 
Miller et al. AGN with low [S II] /Ha and [O I] /Ha ratios, 
which are more likely to be dominated by star-forming 
activity, is as high as 20%, which represent more than 
double the number of Seyferts. AGN defined using the 
Kauffmann et al. criteria severely leak into the H ii locus 
in both [S II]/Ha and [O I]/Ha diagrams. 27% of these 
sources, or 45% of the objects situated between the Kew- 
ley et al. and Kauffmann et al. lines (the blue dots in the 
[N II] /Ha vs. [O III]/H/? diagram), would be classified 
as H II using the criteria in Table ^ Seyferts remain 
poorly represented in the Kauffmann et al. sample as 
well (less than 5%), while LINERs, although constitut- 



ing the majority of the potentially accretion-dominated 
sources (25%), are greatly outnumbered by H ii's. 

This comparison shows that the Kauffmann et al. and 
the Miller et al. AGN classifications, although power- 
ful through their great statistics, may not best represent 
accretion dominated narrow-line galaxies. Such classi- 
fications include a significant fraction of galaxies that 
show star-forming activity, and certainly, a mix of dif- 
ferent types of AGN. For the purpose of understanding 
differences between emission-line activity originating in 
star formation and black hole accretion, a more stringent 
characterization of the dominant power source, particu- 
larly as a function of [S II] and/or [O I], is necessary. 

Note that in our classification of the emission-line sys- 
tems we weighted equally the three diagnostic diagrams, 
even if the error distributions in the line ratios vary from 
diagram to diagram. In particular, the errors in the 
[O I] fluxes and [O I] /Ha flux ratios are generally larger 
than those of the other stronger features. However, for 
Seyferts in particular, the [O I] error distributions are 
signiflcantly narrower than those of other object types, 
suggesting that for this particular category the classifi- 
cation is especially trustworthy. 

2.1.2. Our Samples 

We select from the parent sample of galaxies a sub- 
set of strong "emission-line" sources and a set of "pas- 
sive" objects that show insignificant, if any, line emis- 
sion activity. The emission-line sources are galaxies that 
exhibit aU six fines Ha, H/3, [O III],[N II] [S II], and 
[O I] in emission, i.e. the line fiuxes have positive val- 
ues and the fluxes are measured with at least 2a con- 
fldence. We identify 54,410 such strong line-emitters, 
which comprise w 20% of all objects. Note that our 
definition of emission-line objects is very restrictive, and 
therefore the detection fracti on of such sources among all 
galaxie s is smaller than wh at IHo. Filippenko fc Sargent I 
l|1997a|) and IMiller et a,l. 1 (|2003f) find (> 40%). When 
we relax the "6-line" definition criteria to include any 
kind of emission-line activity, we recover a similarly high 
fraction. The "passive" galaxies are defined here as the 
objects for which Ha, H/3, [O II], and [N II] are not de- 
tected in emission (i.e., their EWs are positive in the 
emission-line catalog, indicating an absorption feature). 
This sample, which includes 59,197 sources, is a subset 
of the red, old, non-Ha emitters. 

Among the emission-line galaxies described above, we 
use the criteria listed in Table Qto identify 34,781 H ii- 
type galaxies, 2,392 Seyferts, and 7,468 LINERs, among 
which 2,155 are Transition objects. There are a signifi- 
cant number of strong emission-line objects («18%) that 
do not simultaneously comply with these sets of condi- 
tions: these are objects that show large [N II] /Ha ratios 
(> 0.6) but low [S II]/Ha (< 0.4) and/or [O I]/Ha (< 
0.08), or small [N II] /Ha and large ratios involving [S II] 
and/or [O I]. For the sake of simplicity, we exclude these 
sources from the comparative analysis of clustering of 
Seyfert, LINER and Hii populations. 

Although [O I] is generally weak and therefore diffi- 
cult to measure accurately, this feature is very useful in 
separating the Transition objects among LINERs. Note, 
however, that requiring its detection in the sample def- 
inition severely diminishes the sample sizes. Without 
this condition on [O I] and consequently on [O I] /Ha, 
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we identify 98,556 line emitters, or « 40% of the galax- 
ies, among which 3,052 are Seyferts, 12,779 are LINERs, 
and 65,971 are H ii galaxies. In our clustering analysis, 
we compare the clustering properties of active galaxies 
whose definitions both include and exclude conditions 
on [O I] and [O I] /Ha. 

2.2. Subsample Properties & Selection Effects 

We begin by forming the "Best" absolute-magnitude 
limited sample of galaxies using the definition illustrated 
in Figure ^ then apply the criteria listed in Table to 
form spectroscopically-classified subsamples. The num- 
bers of objects in each subsample are recorded in Table 
12 Figure 01 shows the redshift z distributions for these 
samples and comparisons of their apparent and absolute 
r magnitudes are presented in Figure El 

It is clear that, although drawn from a parent sample of 
galaxies with a fixed range of intrinsic luminosities, sub- 
samples of spectroscopically-classified objects are not, in 
general, volume limited. The necessity of detecting the 
spectral features employed in classification causes a red- 
shift dependence. For example, LINERs and Seyferts are 
less prominent than other types of sources at z > 0.1, 
and seem to populate the brighter galaxies, both in ap- 
parent and absolute magnitudes, while the H ii systems 
are shifted toward fainter apparent and absolute magni- 
tudes. 

Whether a specific galaxy is included into a given spec- 
tral category is strongly subject to multiple selection ef- 
fects that vary with redshift. The fraction of galaxy 
centers exhibiting measurable emission lines decreases 
with redshift as a consequence of decreasing signal-to- 
noise. More importantly, the increase with distance of 
the projected size of the SDSS 3" spectroscopic aperture 
causes dilution of emission-line activity by the increasing 
amount of host galaxy light. Thus, weak line-emitting 
systems are more likely to be misclassified as non-active 
at larger distance. These effects explain the differences 
seen in the redshift and magnitude distributions of the 
accretion type sources (Seyferts, and at least some LIN- 
ERs) and the galaxies in which activity is mainly driven 
by hot young stars (H ii). While in the latter objects the 
power source is more extended, and thus more likely to be 
observed at larger distances, in the accretion dominated 
nuclei the ionization originates in more centrally concen- 
trated regions and thus is less likely to be included in the 
emission-line sample at large z where the signal-to-noise 
is lower. 

Note that for our specific goals of separating as cleanly 
as possible among different types of emission-line sys- 
tems, the fixed aperture biases remain inconsequential. 
Regarding the detection of the AGN dominance, the fixed 
fiber effects go only in one direction: the more distant ob- 
jects are less likely to be detected as AGN as stellar con- 
tamination becomes more pronounced; hence, such cases 
would most likely account for the large fraction of ob- 
jects that remain unclassified based on our criteria, and 
which we exclude from our comparative analysis between 
properties of bona-fide Seyferts and LINERs. Likewise, 
we do not attempt any aperture correction for the line 
fiuxes as estimates of total quantities (like e.g., the star 
formation rate) remain irrelevant for this work. Note 
also that the relative number statistics of various types 
of emission-line systems are indeed expected to be dif- 



ferent for different z bins for a fixed fiber size. However, 
such possible classification biases are minimized through 
our choice of volume-absolute magnitude limited sample 
that spans relatively short ranges in both redshift and 
brightness. 

We find that the different spectral classes of objects 
exhibit distinct behavior in several physical characteris- 
tics. Figure shows the distributions in log ^[oi] cind 
log i[oiii]7 the intrinsic column density Nh, and the elec- 
tron density n^, separately for the Seyferts, LINERs, H ii 
galaxies, and the whole sample of 6-line emitting galax- 
ies. Line luminosities i[oiii] a-nd i[oi] £^re calculated 
only for objects that show fiux measurement uncertain- 
ties of < 50% in these lines and in the Balmer decrement 
Ha/H/?, which is used to correct for intrinsic extinction. 
We assume a dust-free case-B recombination value of 2.86 
for the standard, not absorbed Ha/H/?, and, for the sake 
of simplicity and to ease comparison with previous calcu- 
lations by Kauffmann ct al. (2003), an attenu ation law 
of the form r cx A^^-^ (Chariot & FaF'200^. We use 
the same method for calculating the Nh values. To esti- 
mate we employ the [S II]A6716/A6731 line-flux ratio 
lle.g.. Osterbrocklfl989l) . The [O II]AA3729,3726 feature 
could also be used as a density indicator, as it is avail- 
able in the majority of the SDSS galaxy spectra, and the 
emission-line catalog that we use here includes measure- 
ments of the individual components. However, the small 
separation of only 3A between the two lines may cause 
large uncertainties in their fluxes, and therefore in the rie 
estimates as well. 

Investigation of the L[oiii]-dependent behavior of dif- 
ferent types of galaxies is important because this pa- 
rameter is claimed to trace the AGN activity, at 
least for luminous system s with large [N II] /Ha ratios 
ijKauffmann et af~l 12003 ^1. This work showed evidence 
for the fact that highly [O III] luminous objects are more 
likely to be powered by an accretion-type nucleus, sug- 
gesting that larger i[oiii] may correspond to larger accre- 
tion rates. A scrutiny of the role played by -i'[oi] distin- 
guishing among different spectral types and in their spa- 
tial clustering properties is motivated by the strong ca- 
pability of the [O I] feature to differentiate between ther- 
mal and non-thermal ionization sources. Given that [O I] 
arises preferentially in zones of partly ionized hydrogen, 
that are extended only in objects photoionized by a spec- 
trum containing a large fraction of high-energy photons, 
i.e., originating in accretion, and therefore nearly absent 
in galaxies photoionized by OB stars, one might expect 
that L[oi] is also be a good indicator of the strength of 
the accretion activity. Such expectations arc supported 
by recent work of Constantin ct al. ( 2006) who indicate 
that the presence and strength of the [O I] emission may 
be linked to the presence and strength of the broad Ha 
emission, and thus to the dominance of the accretion- 
type ionization in the nuclear spectra of nearby emission- 
line galaxies. 

Interestingly, in terms of both £[oiii] and i[oi]j LIN- 
ERs and Seyferts show a strongly antagonistic behavior. 
LINERs stand out as the objects with the lowest L[oiii] 
and i[oi] relative to other spectral classes, rarely exhibit- 
ing values of log i[oiii] > 40, with L[oiii] in units of erg 
s~^. Seyferts, on the other hand, are the galaxies that 
are most likely to be luminous in these emission lines; 
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the median values for log i[oiii] are ~38.6 for LINERs, 
and ^39.8 for Seyferts. Although the lack of LINERs 
with high i[oiii] can be explained by their definition as 
low ionization emission regions, which explicitly selects 
lower [O III] fluxes, the absence of high i[oi] LINERs is 
intriguing. Because the redshift and magnitude distribu- 
tions of Seyferts and LINERs are very similar, selection 
effects cannot be attributed to this trend, thus their low 
L[oiii] and low i[oi] must be intrinsic to these systems. 

Dust obscuration is another variable that could signifi- 
cantly affect the detection of AGN-like activity. In terms 
of intrinsic attenuation, as measured by Nh, LINERs are 
again at one extreme, showing low levels of extinction rel- 
ative to Seyferts and H ii's, with the latter being the most 
obscured of the emission-line objects. These trends are 
to be expected, as star formation is generally accompa- 
nied by significant amount of obscuring dust. However, it 
may also be the case that in such a dusty star-forming en- 
vironment, a potentially present weak AGN-like nuclear 
emission is suppressed, because it lies buried in surround- 
ing H II- like features. 

Based on the [S II] line flux ratio estimates for the elec- 
tron density, Seyferts show the highest Tig , cLS expected 
for sources in which the dominant ionization mechanism 
is accretion of matter onto a black hole. The LINERs 
show lower Ue densities, while the starforming systems 
clearly populate the lowest end in the distribution; 
the median rig is w 5 x 10^ cm~'^ , « 1.5 x 10^ cm~^ , 
and w 5 X 10^ cm^^ for the Seyferts, LINERs and H ii's 
respectively. A Kolmogorov-Smirnov test indicates an 
almost null probability that any pair of these subsam- 
ples draw their distributions in [S II] ratio from the same 
parent distribution; the KS probability that Seyferts and 
the whole sample of emission-line emitters belong to the 
same parent population is KSscy/cmiss = 4.24 x 10~^^, 
while for the comparison between the line emitters and 
the H ii's, KSomiss/Hii = 2.42 x 10~^^. There are roughly 
3000 galaxies that manifest emission-line activity with 
He < 10 cm~'^, which is below the theoretical lower limit 
for which rie can be estimated based on [S II] emission; 
about half of these objects are H ii galaxies. 

Median and average values of the parameters for which 
the distributions are compared among different spectral 
types of objects are recorded in Table El 

3. THE CORRELATION FUNCTION 

To quantify clustering in the observed galaxy distribu- 
tion we estimate the 2-point correlation function (CF) 
^(s), which measures the excess probability over random 
to flnd an object pair separated by comoving distance s. 
The separation s is measured in redshift-space, thus it is 
important to note that the shape and amplitude of this 
measure of spatial clustering are determined not only by 
the spatial configuration but also by peculiar velocities. 
Small scale velocity dispersion, e.g., in cores of galaxy 
clusters, damps small scale clustering, while linear infall 
onto dense regions amplifies clustering on large scales. 

Accurate estimation of the correlation function re- 
quires careful accounting for both angular and radial 
variation in the expected galaxy density. The former 
requires knowledge of the complicated geometry of the 
angular mask that describes the regions on the sky in 
which data were obtained and the sampling of galaxies 
in the observed regions, while the latter variation may de- 



pend on physical characteristics of the samples analysed, 
which can introduce important selection biases with dis- 
tance. 

To account for the survey geometry we employ random 
catalogs that describe the SDSS DR2 survey angular se- 
lection function. These random masks are generously 
made publicly available in t he NYU Value-Added G alaxy 
Catalog and described in iBlanton et al. I l)2004|) . The 
random samples we employ in all the ^(s) measurements 
are at least 10 times as large as the datasets involved. 
As discussed in Section 12.1.11 the z distributions for the 
volume limited samples of the different categories of ob- 
jects are not uniform, and are different from sample to 
sample. We therefore build individual radial selection 
functions for the random masks corresponding to each 
sample to be studied; we randomly assign rcdshifts that 
follow distributions constructed based on 2nd order poly- 
nomial flts to the observed object z distributions. We 
have explicitly tested the CF ^(s) estimates for differ- 
ent parametrizations of the redshift distributions N{z), 
and flnd that they do not signiflcantly affect the results; 
the power-law fits, described below, show slopes and am- 
plitudes that remain consistent within the value range 
bordered by uncertainties. 

As discussed above, we do not account for the potential 
biases on small scales introduced by the fiber collisions, 
because we have no reasonable means of doing so. In 
other estimates of galaxy clustering properties, such bi- 
ases have been corrected for by assigning each pair mem- 
ber whose redshift was not obtained because of the fiber 
collision the same z as the pair member whose redshift 
was measured. However, for the purpose of studying the 
dependence of clustering on spectral characteristics we 
cannot apply such correction because there is no physical 
motivation for assigning to the unobserved pair member 
the spectral properties found to characterize the object 
whose spectrum is available. Therefore we treat the fiber 
collisions as a contribution to incompletness in the sur- 
vey; this variation of the completeness with angular po- 
sition of the is accounted for in the random catalogs de- 
scribed above. This procedure leaves a residual effect due 
to fiber collisions that is significant at very small scale, 
i.e., at comoving transverse separations of ~0.14/i~^Mpc 
(the corresponding hnear scale for 55", at the outer edge 
of our samples cz — 50,100 kms~^). We restrict our mea- 
surements to separations > 0.14ft,~^Mpc and note that 
the incompletness biases due to fiber collisions should 
similarly affect all of the subsamples we analyse, and 
thus are not a significant concern for this study because 
our focus is on the relative differences between the CF 
characteristics. 

We calculate the c orrelation functions using the 
iLandv fc Szalav I l)1993j) estimator, 



as) = 1 
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DR{s) 
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RR{s) N RR{s)' 

using bins in pair separation s that are logarithmically 
spaced with a width of 0.33 in log(s/ft.~^Mpc), starting 
from O.Ulh^^Mpc. In this formula, DD{s) and RR{s) 
are the number of pairs in the data and random cat- 
alogs respectively, while DR{s) is the number of pairs 
between the data and the random samples. Pair counts 
are estimated using the computationa l ly effi cient "npt" 
algorithm described in iMoore et al. I l|2000() . We esti- 
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mate the covariance matrices of the uncertainties u sing 
the statistical jackknife method l)e.g.. Lupton lITOQ^ . for 
which we divide our samples into 75 separate regions on 
the sky, of approximately equal area, ^ 10degx5deg, 
and estimate the C F 75 times, each time leaving out one 
of these regions fsee lZehavi et al.H l|2002,l for a more de- 
tailed description of this method and for a discussion on 
its robustness in estimating the uncertainties). 

We quantify the clustering properties of a given sample 
by fitting the measured CF with a power law of the form 

as)^{s/soy\ (3) 

where s denotes the comoving separation between the 
object pairs, sq is the correlation length used to express 
the amplitude of clustering, and the slope 7 quantifies 
the ratio of small to large scale clustering. In the fitting 
process we employ separations 3h~^ Mpc < s < lOh^^ 
Mpc, in order to avoid separations on which the CFs are 
derived from fewer than 20 data-data pairs. 

The CF of the full volume limited "Best" galaxy sam- 
ple is characterized by a correlation amplitude sq = 
7.8h~^ Mpc, and a slope 7 « 1.2. These values are 
consistent with previous calculations of the redshift- 
space clustering of the SDSS main galaxy sample, both 
for magnitude and vo lume limited samples si milarly 
bounded in luminosity ijZehavi et al. ll2002Ll2005|l . 

3.1. Clustering Dependence on Spectral Properties 

We examine here the redshift-space CF for various 
subsamples constructed based on sets of properties that 
reflect differences in the dominant ionization mecha- 
nism, i.e., accretion vs. star-forming type, or provide 
clues to the physical conditions of the line-emitting re- 
gions. We compare correlation functions of different 
spectral classes, and investigate the degree to which other 
emission-line related parameters, e.g., [O III] and [O I] 
luminosities, intrinsic neutral hydrogen column densities 
(Nh), and electron densities n^, may influence the re- 
sults. 

The results of power-law flts to the CFs estimated for 
various subsamples of the parent "Best" absolute mag- 
nitude limited galaxy sample considered in this analysis 
are recorded in Table |21 together with results for the 
CF of all galaxies in the sample. Object sample defini- 
tions as a function of Ue and A'"^- are described in Tables 
13 and 21 It is clear that the clustering properties of 
subsamples of objects governed by distinct spectroscopic 
characteristics are different. In the following subsections 
we discuss these results in detail. 

3.1.1. Spectral Class 

The CFs estimated for emission-line objects, passive 
objects, and the three distinct line-emitting subsets of 
Seyferts, LINERs and H ii's, are illustrated in the upper 
and lower panels of Figure \7\ We present and compare 
the shape and amplitude of power-law fits to the CFs for 
samples whose definition both include and exclude con- 
straints on [O I] and [O I] /Ha, in the right and left panels 
respectively. The CF of the volume limited sample of all 
galaxies is also shown for comparison in all panels. The 
error bars denote the 1-sigma uncertainties. The upper 
panels include insets that compare the 1-sigma, 90%, and 
99% confidence contour levels for the likelihood functions 



obtained by fitting power-law models to the correlation 
functions of actively line-emitting galaxies and the pas- 
sive ones. 

The passive galaxies have a substantially higher am- 
plitude and steeper ^(s) than the emission-line galaxies, 
with a correlation length sq « 11.4/i~^ Mpc, compared to 
So « 6.5h~^ Mpc for the actively line-emitting sources. 
This trend is clearly present whether or not the selection 
of these objects employ the [O I] feature, although the 
datasets are smaller when [O I] is required in defining the 
sample and, therefore, the uncertainties are correspond- 
ingly larger. 

Among the emission-line sources, LINERs exhibit the 
highest clustering amplitude, while H ii's show the weak- 
est clustering, at both small and large scales (see lower 
panels of Figure d) . Seyferts distinguish themselves from 
the other emission-line galaxies by the relatively steep 
slope of their correlation function, 7 > 1.4. This effect 
might arise if the environment of Seyferts causes an ex- 
cess (relative to a fiatter power law) of AGN activity 
on a scale larger than clusters or groups; the power-law 
fits are over separations 3 — 10/i~^Mpc, so the excess 
small-scale clustering is not caused by very close pairs of 
Seyferts. Overall, Seyferts' clustering amplitude remains 
intermediate between that of LINERs and H ii's. 

A more quantitative comparison between the cluster- 
ing properties of Seyferts, LINERs, H ii's, and of the 
whole galaxy sample is presented in Figure |H1 From 
top to bottom, we see that the clustering amplitude 
of Seyferts is significantly smaller than that of the full 
galaxy sample, LINERS are clustered similarly to the 
parent galaxy sample, and HII's are less clustered than 
all galaxies. The left and right sets of panels are for 
AGN sub-samples defined without employing [01] and 
with [01] , respectively. Because this criterion changes the 
number of objects in the sub-samples, as well as chang- 
ing the intrinsic luminosity necessary for this line to be 
detected at fixed distance, we find slight variation in the 
statistical significance of these comparisons. 

That Seyferts are found to be less clustered than the 
average galaxies is an important result, and its statisti- 
cal significance requires some comment. Table |5] shows 
that the clustering amplitudes for Seyferts and for all 
galaxies in the main sample are sq ~ 6.00 ± 0.64 and 
7.80 ± 0.49 respectively, which translates into a differ- 
ence of order 3(t. In the upper panel of Figure |H1 one 
sees that the 2-dimensional " l-c" limits just touch, and 
the 90% contours overlap. This overlap arises because 
the 2-dimensional contours enclose the labeled percent- 
age of the probability in the joint distribution of both 
So and 7, and these elliptical contours extend beyond 
the confidence limits that are computed in one parame- 
ter at a time (by marginalizing over the other parame- 
ter). The natural confidence regions in the 1-dimensional 
parameter space, say so, are the projections of the 2- 
dimensional regions defined by fixed Ax^ into this 1- 
dimensional space of interest. To enclose the same prob- 
ability, i.e., fraction of points, say 68.3% (or Ax^ — 1), 
as the two parameters Sq and 7, the corresponding ellipse 
will correspond to Ax^ = 2.3 as it must necessarily ex- 
tend outside of both of them ^ . The resulting constraints 

^ e.g., see discussion in Chapter 15 of 
Press et al. 1992, "JVumericaJ Recipes in C," 
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on So and 7 are reported in Table [21 

To clarify the statistical significance of the observed 
difference in clustering amplitude of Seyferts and the par- 
ent sample of galaxies, we conduct an additional test. Be- 
cause the Seyferts are a sub-sample of the volume-limited 
sample of galaxies, a simple test of the validity of our re- 
sults is to randomly select sub-samples of galaxies with 
the same number of objects as the Seyfert dataset, com- 
pute the best-fit CF parameters for each, and compare 
the results with the actual Seyfert CF. We form 100 such 
"mock Seyfert" samples for two separate cases: 1) by 
randomly selecting 1000 objects from the parent galaxy 
population, and 2) additionally constraining the mock 
Seyfert sample to have the same redshift distribution as 
the Seyferts. For each mock sub-sample we fit the cor- 
relation functions using the covariance matrix built from 
the variation among the 100 CFs. The results shown in 
Figure El indicate that the CF of the Seyfert sample is 
inconsistent at > 95% confidence with being drawn from 
a parent population of CF's of randomly selected galax- 
ies. In both cases, not a single random sample of galaxies 
has a clustering amplitude sq as low as observed in the 
actual Seyfert sample. 

It is interesting to note that, while for the samples of 
Seyferts and LINERs that use [O I] in their definitions 
the error bars are larger than in the case when this fea- 
ture is not used, the differences in sq and 7 become more 
pronounced, and more statistically significant: LINERs' 
So is in this case indistinguishable from that of the whole 
sample of galaxies, while Seyferts' so is much lower and 
the l-cr contours do not touch anymore. On the other 
hand, so of the star-forming galaxies does not actually 
change between samples that use [O I] or not in their def- 
inition. This is yet another clue to the fact that the [O I] 
emission is not an important characteristic of the H 11 
galaxies, while it certainly makes a difference in defining 
more strictly AGN, i.e., Seyferts and LINERs. 

Interestingly, regarding the CF characteristics of the 
data samples of different spectral types we find a slight 
tendency toward steeper slopes, as compared to that of 
the galaxies' CF. Note also that the correlation func- 
tions of the sub-samples do not necessarily average to 
the correlation function of the whole sample; the CF of 
the whole sample is a weighted sum over the correlations 
of the sub-samples and the cross-correlations between the 
sub-samples. It could be the case that the CFs of the sub- 
samples are steeper because certain environments favor 
the formation of particular species of AGN. In an ex- 
treme example, if Seyferts, LINERs, and H 11 galaxies 
came in clumps of one type only, they would be strongly 
clustered on small scales, still clustered on larger scales, 
but have a relatively steeper correlation function because 
of the extra small-scale clustering. In contrast, the cross- 
correlations between sub-samples, e.g. between Seyferts 
and H II galaxies, would have very little small scale clus- 
tering because they would be mutually exclusive on the 
scale of the clumps, thus the cross-correlation functions 
would be shallower than that of the whole sample. 

In order to examine our results in the context of pre- 
vious work on clustering of AGN observed by the SDSS, 
we also examine the CF of the narrow-line AGN vs. non- 
AGN (i.e., star- forming sources) using the classifications 

|http://www. Iibrary.cornell.edu/nr/bookcp df.html| 



employed by Kauffmann et al., and Miller et al. Fig- 
ure mi shows the CFs estimated for such subsamples 
of the "Best" volume limited galaxy sample and com- 
parisons of the confidence contours of power-law fits to 
these CFs. The galaxy CF is also shown for compar- 
ison. The fitted power-law parameters and number of 
objects in each sample are also listed in Table 12 Here 
we find that AGN samples selected using the criteria ap- 
plied by Kaufmann et al. and Miller et al. show very 
similar clustering amplitudes to that of all galaxies, as 
claimed in previous studies based on estimates of pop- 
ulation fractio ns in cores of gala xy clusters and intra- 
cluster fields llMiller et al. ll20?)3^ ■ or measurements of 
CFs (Wake ct al. 2004). This result is not surprising. 
We show in Section I2.1.1l that AGN samples selected us- 
ing these criteria are dominated by LINERs. We also find 
that LINERs, as a spectroscopically-distinct population, 
are indeed clustered similarly to normal galaxies. Thus, 
the resemblance between the spatial clustering properties 
of such AGN samples and normal galaxies reflects dom- 
inance of LINERs in these AGN definitions, and it does 
not apply to AGN in general, particularly not to Seyferts 
as a separ ate class. Similar trends were reported before 
by Kauffm ann et al.~l l|200 4f) who also note that the envi- 
ronmental effects in the Miller et al. sample are likely to 
be dominated by LINERs. We show here with great con- 
fidence that the environments and the large scale struc- 
ture of weak and powerful AGN are distinct, and that if 
these systems are not carefully separated out, one can er- 
roneously characterize the AGN population when instead 
refers to LINER-like objects only. 

The results presented in this section indicate clearly 
that bona fide narrow- line low luminosity AGN, i.e., the 
Seyfert 2s, are less spatially clustered than typical galax- 
ies. These findings have important implications for the 
environment of the host galaxies of AGN. The statistics 
of peaks in a Gaussian random field suggest (and results 
of fully non-linear N-body simulations demonstrate) that 
a large clustering amplitude is indicative of a popula- 
tion of objects that inhabit high peaks (dense regions) 
and, conversely, that a low clustering amplitude implies 
a sample that probes lower peaks of the field and less 
dense environments. Thus, we infer that Seyferts and 
H II galaxies tend to prefer less crowded environments 
than typical galaxies, while LINERs seem to follow the 
galaxy spatial distribution, as their correlation function 
is, within the uncertainties, identical to that of the whole 
galaxy sample. The cause of the relatively steeper slope 
of the CF of Seyferts is unclear, but it may be further 
evidence of a special environment for promoting their ac- 
tiv ity. Our analy s es see m to confirm the trends reported 
by iMiller et al. I l|2003^ , which show a the predilection 
of the star-forming galaxies to reside in more sparsely 
populated regions, and of the passive, non-line-emitting 
galaxies to live in more dense environments. 

3.1.2. [0 III], [O I] line luminosities 

Figure El shows the CF of subsamples selected by lu- 
minosities of the forbidden narrow emission lines [O III] 
and [O I]. We refer to objects that have log i[oiii] > 40 
and log L[oiii] < 37 as the high and low i[oiii] sam- 
ples respectively. Line luminosities arc in units of erg 
s^^. Conforming with the definitions listed in Table (21 
a similar sample name designation is used for the low 
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and high i[oi] object samples. Our calculations show 
that objects that are bright in [O III] and/or [O I] are 
significantly less clustered than sources that are less lu- 
minous in these lines. We confirm the depen dence of the 
cluste ring amphtude on £[oiii], as seen bv IWake et al. I 
l)2004() . For the first time, we detect an even stronger 
variation of clustering with i[oi]- 

For objects with relatively high [N II] /Ha ratios (above 
the Kewely et al. curv e). [O III] is, as discussed by 
iKauffmann et al.1 l)2003(l . only weakly affected by resid- 
ual star-formation, and therefore capable of distinguish- 
ing line emission activity due to accretion from that 
originating in star formation. As illustrated in Figure 
El using our spectral classification, the high i[oiii] ob- 
jects include most of the Seyferts. Interestingly, the 
subset of luminous emission-line galaxies includes almost 
no LINERs, which clearly dominate the low luminosity 
end of the distribution. This trend supports and justi- 
fies furthe r the physics b ehind the correlation found by 
IKauffman n et al. 1 1|200.'^ between the ionization state of 
the AGN, as expressed by their position angle in the 
[N II] diagnostic diagram, and the AGN [O III] luminos- 
ity. It is important to note that, because the number of 
Seyferts remains small in general, H ii's remain the dom- 
inant type in the high ipiiij subset, comprising ~ 40% 
of these objects; the small fraction of H ii's with high 
^[Oiii] makes a large contribution to this subsample and 
so complicates interpretation of our estimates of the CFs. 
For the sake of compariso n, note also that str ictly among 
AGN classified based on lWake et al. I l|2004D definition, 
that implicitely exclude H ii's, the highest third in L[oiii] 
remains still weakly represented by bona-fide AGN: the 
Seyferts make up only 30% of the objects, while the rest 
of them are either LINERs or unclassified emission-line 
objects. 

The CF of the high L[oiii] systems shows a low clus- 
tering amplitude, as expected from the properties of the 
objects that comprise the sample. Both Seyferts and H 
Ii's are weakly clustered, with amplitudes that are simi- 
lar, within the uncertainties. The steep slope of this CF, 
7 w 1.4, suggests a strong influence in behavior from the 
Seyferts. The low iioiii] sources exhibit a CF that is 
consistent in both the amplitude sq ~ 8.8 and the slope 
7 « 1.3 with what we measure for the LINER sample. 
Such strong correspondence may indicate that low levels 
of activity, either originating in star formation or accre- 
tion, is predominantly present in highly clustered hosts, 
and thus more probable in crowded environments. 

The variation in the clustering amplitude between the 
samples of low and high i[oi] is very strong, showing 
a difference of ~ 6a. As with selection by iv[oiii], the 
low i[oi] objects have large clustering amplitudes, while 
the more active sources remain weakly clustered. The 
dominant majority of low ^[oi] objects are LINERs, sug- 
gesting again that galaxies that manifest the lowest level 
of activity are the ones that are strongly clustered. LIN- 
ERs remain very rare (less than 5%) in the high ^[oi] 
sample. The high i[oi] subset is highly populated by 
Hii's (~ 60%), and, although this dataset includes the 
majority of Seyferts, such systems make up only ~ 9% 
of the objects. Consequently, the clustering properties of 
the high L[oi] objects are close to those of Seyferts and 
star-forming galaxies, revealing a relatively small clus- 



tering amplitude. 

As discussed in Sections 12.1.11 and 12.21 a strong link 
is expected between the strength of the [O I] emission 
and that of the accretion process. Thus, the clustering 
dependence on i[oi] may be another indication of the 
relationship between the level of AGN activity in galax- 
ies and the properties of their host dark matter halos: 
the active nuclei that exhibit the lowest levels of activity 
due to accretion are the most clustered, and thus, the 
most massive ones. Any signature of star-formation ac- 
tivity is feeble, if present at all in these systems; although 
numerically dominant among the emission-line systems, 
the H ii-like galaxies are significantly underrepresented 
among the low luminosity objects (see Figure EJ. On the 
other hand, the most active sources, whether in terms of 
star-formation or accretion, are weakly clustered, thus 
we infer that these sources are hosted by relatively less 
massive systems. 

3.1.3. Gas Density & Intrinsic Extinction 

As Figure EliUustrates, LINERs stand out among line- 
emitting objects in other aspects as well. Their electron 
densities are lower than those of Seyferts but generally 
higher than in H lis, and they show little intrinsic obscu- 
ration. A clear segregation of LINERs and Seyferts as a 
function of these parameters, similar to that as a func- 
tion of i[oi] and i[oiii], is possible only for very small 
subsamples, and thus CF estimates remain impractical. 
Nevertheless, an investigation of the degree to which the 
clustering amplitude depends on rig and Nh remains 
of interest even if the subsamples characterized by low 
and high values of these parameters comprise a mix of 
spectral types. Such analysis offers additional insights 
into the origin of the clustering differences exhibited by 
sources with contrasting physical conditions, especially 
those that may characterize the availability of fuel for 
accretion or star-formation. 

Figure El shows a comparison of the CFs estimated 
for subsamples characterized by low and high values of 
TT-e and Nh, with the sample definitions given in Tables 
and 01 Consistent with the trends found so far, the 
low rie subsample, in which 70% of the objects are H ii 
galaxies, has a significantly lower clustering amplitude. 
Interestingly, the high Ue subsample shows a high cluster- 
ing amplitude that is consistent with that of the LINERs 
and the average galaxy sample even though it comprises 
a generously mixed population of emission line systems: 
40% LINERs, 18% Seyferts, 19% Hiis, and 23% unclas- 
sified line-emitters (that do not comply simultaneously 
with all the three diagnostic diagrams, see Section I2.1.1|l . 
It is striking that only a slight number dominance of 
the LINERs in this subsample of sources with relatively 
high Ue leads to such results. This trend suggests that 
the moderately high rie unclassified emission-line objects 
have clustering properties more similar to LINERs than 
to Seyferts or Hiis. 

The subsamples split on Nh shows clustering trends 
that are consistent with the results presented so far. 
There is a slight suggestion that the less obscured sources 
{Nh < 10 X 10^0 cm-2), in which the LINERs are in 
majority, are more clustered, while the highly obscured 
systems, which are mostly star-forming objects, are less 
clustered. These differences reflect once more a potential 
link between the amount, nature, and availability of the 



10 



fuel that powers the galactic activity and their clustering 
characteristics: the dusty spirals that usually live in less 
crowded environments are more likely to harbor H ii's 
in their centers, while the nuclei of ellipticals would be 
generally dust-free due to ram-pressure stripping in their 
more crowded, cluster-like habitats, and thus would show 
LINER-like activity and physical properties. 

3.2. CF Estimates and Host Galaxy Properties 

Studies based on large redshift surveys, 2dF and SDSS 
in particular, have clearly shown that spatial clustering 
of galaxies depends on galaxy structure (or morphology) 
luminosity, and internal star form ation history, as mea- 



surcd by spect ral typ e or colors (Norberg_g^^Lj "2^^, 
[2002; Zchavi et al. 11200 2. 2005; Budava ri et al. 1 12003 : 
iMadEfwick et al. II200.1D . The sense of this dependence 
is that more luminous, redder, and earlier type galaxies, 
which also harbor older stellar populations, are generally 
more clustered than the rest of the galaxies. 

The well-described dependence of clustering on galaxy 
properties might explain some of the difference we see 
between the clustering amplitude of the H ii's, which 
seem to prefer bluer, later-type hosts, and that of AGN- 
like objects (i.e., Seyferts and LINERs), which are more 
likely to be hosted by red, early type galaxies. As illus- 



trated in Section 12.1.11 AGN samples defined based on 
simple classification schemes are dominated by LINER- 
like behavior and are therefore more clustered than the H 
II systems (i.e., Figure I10|l . The median values of con- 
centration indices, colors, and other parameters of the 
host galaxies of our spectroscopically-classified subsam- 
ples (see Figure ^1 and Table |SJ) clearly illustrate the 
distinct preferences of the stellar and non-stellar photo- 
ionized emission-line activity for different host types. 

The variation of clustering among Seyferts, LINERs, 
and H II systems is not, however, so easily explained. 
When we use more detailed spectroscopic classifica- 
tion to separate Seyferts and LINERs, we find that 
they differ significantly in their clustering properties 
despite our finding, consistent with previous studies 
ije.g.. Ho. Filipp enko fc Sargent lllQQfhl) . that the large- 
scale, global properties of their hosts are quite similar. 
Although there is a slight shift of the Seyferts toward 
bluer and later type hosts than LINERs, comparison us- 
ing a Kolmogorov-Smirnov (KS) test shows that their 
distributions in concentration index, C, and color u — r 
are consistent with being withdrawn from the same par- 
ent population {KSg^^ = 0.104, KS^"^"^ = 0.617). LIN- 
ERs' host morphology (C) and color {u — r) distributions 
are quite different from those of the whole galaxy sample, 
both in the shapes and median values of these distribu- 
tions (KS^,f_^ = 4.8e-14, KS^J^Z^ = 5.6e-ll). Yet, the 
clustering amplitudes of LINERs and typical galaxies are 
very similar. Likewise, Seyferts and the passive galaxies 
are close to sharing the same parent distributions in host 
properties (KS^^,^,^^ = 0.108, KSp^'^..^ = 4.2e-4), 
although their clustering properties are markedly differ- 
ent. Furthermore, the clustering amplitudes of Seyferts 
and H Ii's are quite similar, but their galactic hosts are, 
beyond any doubt, very different in type. 

To summarize, using broad photometric morphologi- 
cal measures (e.g., concentration index and color), we 
find no evidence that the difference in clustering ampli- 



tudes of different spectral classes of galaxies, and in par- 
ticular Seyferts and LINERs, is driven by the density- 
morphology relation. 

In the light of the effects the host characteristics have 
on clustering, the differences in sq exhibited by the low 
and high ^[oi] ^-nd -/^[oiii] samples merit further attention 
as well. As it can be seen from Figure these datasets 
are clearly distinguishable in their median concentration 
indices and u — r colors, with which the clustering am- 
plitudes seem to correlate. The sources with low [O I] 
and [O III] luminosities are also the reddest u — r k, 2.8 
and generally early in their morphological type, as re- 
vealed by their high median value of C « 3, and present 
strong clustering. The most luminous sources in [O I] and 
[O III] are bluer (w — r « 2.3) and of later type morpholo- 
gies (C « 2.6), and are weakly clustered. Thus, it may 
be apparent that the different clustering amplitudes for 
the high and low luminosity objects is at least partially 
driven by the morphology-density relation that galaxies 
usually obey. However, based on exhaustive studies of 
the color and host type dependence of the correlation 
function, iZehavi et al. I l|20 02. 2005) reveal differences 
in So of only $2/i~^ Mpc, which are smaller than those 
between our low and high i[oiii] systems, and especially 
those we observed between the objects with high and low 
L[oi] (see Table EJ. Moreover, our subsamples span a 
significantly reduced range of values in their median C 
and u — r than those employed in the Zehavi et al. com- 
parisons, suggesting that, the morphology-color effect on 
the discrepancy in sq measured in our samples should 
be even smaller. We thus conclude that the differences 
in the clustering amplitudes of objects that differentiate 
themselves through their [O I] and [O III] luminosities 
are primarily constrained by these parameters, and only 
secondarily by their host properties. In the following sec- 
tion we discuss various scenarios that may explain such 
effects. 

4. DISCUSSION 

4.1. AGN Fueling & Life Gycle 

Under simple assumptions, the amplitude of the 
correlation function can be used to estimate the 
typical mass o f the dark matte r halos in which the 
objects reside ij^. ^. K^i^e r 111984 : Grazi an~et al. ll2?)(il: 
iMagliocchetti et al. II2004D . and f or AGN in p articular, 
to es timate their typical lifetime ijMartini fc W cinbcrgi 
|2001[1 . If, as expected in the hierarchical picture of 
structure formation, the most massive systems are more 
clustered, our results suggest that Seyferts live in less 
massive halos than those that host LINERs. The empir- 
ical relation between black hole mass ^/bh f-nd galaxy 
mass Mdm l|Ferrarese 1120021 iBaes et al. 112001) . that is 
based on the Mbh — cr* relation whi ch seems to hold 
similarly for both inactive galaxie s ijGebhardt et al. 1 
I2000al iFerrarese fc Merritt ' '2000^ an d AG N hosts 
llGebhardt et al. I 2000b.; Fc rraresc eTan 1200 ll 
■Nels on et al. 1 120041 To^ken et al. I l2004t IGreene fc Ho I 
12OO50, implies that the black holes harbored by these 
galaxies must scale in the same manner. Thus, Seyferts' 
black holes must be smaller than those that live 
within LINERs. Via direct comparisons of 2PCF for 
simu lated dark m a tter a nd for observed SDSS galax- 
ies, I Wake et al. I l)2004D reached similar conclusions 
regarding the masses of the black holes. 
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Together with the different trends we see in the phys- 
ical properties of active galaxies with distinct spectral 
classification, this possible distinction in their BH mass 
offers the missing link in our understanding of their in- 
trinsic nature. The high i[oi] ^nd £[oiii] of Seyferts 
suggest that their smaller BHs are accreting at relatively 
higher rates or maybe more efficient than the larger BHs 
whose activity is classified as LINER-like. The low ac- 
tivity level of LlNERs' massive BHs may well be a con- 
sequence of the fact that these systems have less fuel 
available for accretion. Both the electron density and 
column density of surrounding (dusty) material are sig- 
nificantly lower in these objects, suggesting a scarcity of 
matter supply. Seyferts, on the other hand, show the 
highest values of Ue and a relatively large range in Nh 
suggesting that in these sources fuel can easily be pro- 
vided at levels sufficient to sustain accretion at modest 
levels. 

In this scenario, Seyferts grow their BHs at a rela- 
tively fast pace, and this activity would be expected to 
last for short periods of time. Such a highly active phase 
would end with the exhaustion of surrounding accretion 
fuel, when the BH mass reaches levels similar to those of 
LINERs, after which weak (or inefficient) accretion can 
continue for much longer time. This picture is consis- 
tent with the relative numbers of Seyferts and LINERs. 
If Seyferts are simply the manifestation of the stronger 
but short accretion phase, then their numbers must be 
small, especially when compared to those of LINERs that 
subsist much longer at low activity levels and thus are 
more common among active galaxies. The results of our 
AGN classification, together with other previous stud- 
ies discussed above (see section I2.1.1|l indicate indeed 
that LINERs numerically dominate the AGN samples, 
as the LINER-type emission is found in at least three 
times more galaxies than Seyfert-like activity. 

If LINERs harbor more massive black holes then, as 
suggested by the close connection between BH mass and 
bulge size in their host ga laxies ije.g.. Maeorrian et al. I 
ll998HGraham et al. 120?^ . LINERs' hosts must be more 
bulge-dominated than those of Seyferts. Although we 
have seen that the Seyfert and LINER overall distribu- 
tions in C and color are very similar, the median values 
show small differences (see Table indicating slightly 
higher C and redder u — r colors for LINERs, consistent 
with the expected trend. Future larger SDSS samples, 
together with more suc cessful morphology classification 
schemes ijPark fc Ghoi lf2005,') should allow more detailed 
investigations of such feature. 

4.2. The H II & the Transition Systems 

Further, we may extrapolate this picture to galaxies 
whose nuclear activity is dominated by star-formation, 
the H II systems. Assuming that the same scaling rela- 
tions hold for these sources, the weak clustering of H ii's 
implies that they harbor relatively small black holes in 
their centers. In general, they show moderately low, qui- 
escent [O I] and [O III] luminosities. However, because 
they are very numerous, the high luminosity tail of their 
line-emission distribution dominates the statistics of the 
sub-sample of luminous actively line-emitting galaxies. 
There is a generally high level of obscuration in these 
objects, as shown by their large column densities Nh- 
The overall low rie values that characterize the emitting 



gas in H ii's suggest that fuel for eventual accretion is 
tenuous in these galaxies. Thus, if their BHs are accret- 
ing, the rates must not be high or, if the BHs are actively 
accreting, the the resulting radiation is highly obscured. 
In other words, even if the kinetic energy output from 
the central few tens of parsecs is as strong as that of 
the Seyferts, there is only a very small fraction that gets 
transferred to the thermal plasma to produce the en- 
hancement in the optically coUisionally excited lines, at 
the level seen in the accretion-powered Seyferts or LIN- 
ERs. Gonsequently, any accretion signature due to ro- 
bust activity would be sub-dominant in the H ii's spectra 
because it would be buried in the surrounding emission 
originating in stellar heating. 

We now turn to consider the Transition objects using 
similar arguments. Figure El illustrates the GFs esti- 
mated for the Transition and Pure LINER samples (see 
definition in Table ^ along with a comparison of the 
distributions in C and the u ~ r color for these sam- 
ples. Figure 1161 illustrates how Transition systems and 
the Pure LINERs compare in terms of i[oi]j ^[oiiiji Nh, 
and Ue- The results of the power-law fitting of the corre- 
lation functions, that are also recorded in Table El show 
a great discrepancy in the clustering amplitudes of these 
two sub-categories of LINERs. The Pure LINERs, which 
constitute the majority of the LINER sample, remain 
strongly clustered with an amplitude similar to that mea- 
sured for the whole LINER sample and consistent with 
that of the whole galaxy sample. The Transition objects 
are clearly less clustered than the Pure LINERs, and 
are comparable in their clustering amplitude to Seyferts 
and/or H Ii's. 

As expected, the intrinsic properties of the Transition 
objects suggest a tendency toward a more Seyfert or 
H ii-like behavior. These systems show systematically 
larger i[oi] and i[oiii]j and higher Nh than the Pure 
LINERs. However, the nature of the Transition objects 
continues to remain ambiguous. Their lower rig makes 
them more similar to the star-forming systems than to 
Seyferts, while their host galaxies remain in general of 
early-type, in the range of those that usually harbor 
Seyferts, not H Ii's. Their clustering amplitude is sig- 
nificantly lower than that of the typical red, early type 
galaxies. Thus, a possibility for the power source in these 
objects is a small black hole that is accreting at rates that 
are higher than those in LINERs but still low, seemingly 
due to the general fuel deficiency. Given that the obscu- 
ration remains relatively low in these sources, moderate 
activity originating from such a source could explain their 
intermediate behavior between that of traditional AGN 
and H II systems. 

Our own Milky Way seems to provide the most nearby 
example of a typical H ii or H ii/LINER transition ob- 
ject, as outlined above. There is compelling evidence for 
the presence of a re lativelv low mass Galactic black hole 
()e.g.. Schdel et al. I l2003: Ghcz 2004) that is accreting, 
as shown b y radio (Sgr A* so urce, Balick & Brown 1974) 
and X-ray ()Baganoff et al. I t2003'l emission, and which 
exhibits large circumnuclear extinction {Ay ~ 31 mag; 
Rieke, Rieke & Paul 1989). In terms of its nebular lu- 
minosity, surface brig htness distribution, and ionization, 
iShields et al. I l)2005 f> reason that the Galaxy may be a 
typical example of a H ii or maybe a transition system, 
although its Sbc Hubble type is most consistent with the 
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former type of emission-line object. 

4.3. Environment & AG N Activity 

The importance of the role that interactions with other 
galaxies plays in triggering efficient gas fueling in AGN 
remains a matter of debate. Although there is some ev- 
idence that the frequency of occurrence of Seyferts in 
pairs and groups is higher than among isolated galax- 
ies iKelm et al. 1998, 2004), no clear relationships are 
found between the presence of AGN and detailed mor- 
phological properties that are signatures strong interac- 
tions (e.g., bars, see Keel 1996, Ho et al. 1997). 

A problem with many previous investigations of the 
environments of AGN is that their definition as a class 
relies on relaxed definitions of such objects which result, 
as illustrated in Section 12.1.11 in samples dominated 
by LINERs. The influence of Seyfert-like behavior re- 
mains negligible in such studies due to the small frac- 
tion of such objects. For this reason, results that in- 
fer that AGN environments are undistinguishablc from 
those of the norma l, inactive galaxies ( Mille r et al. 2003; 
iWake et al. I2004j) apply for LINERs, but not necessarily 
for Seyferts. We find that LINERs and Seyferts differ not 
only in their intrinsic physical characteristics (ionization 
level, obscuration, density of their line-emitting gas) but 
also in their clustering properties. It is also important to 
understand that environmental properties may vary with 
spatial scale. For instance, the unusually steep slope of 
the Seyferts' CF suggests that these sources may present 
extra clustering at small scales, despite a generally low 
clustering amplitude at scales > 10h~^ Mpc. Thus, it 
is possible that LINERs and Seyferts share similar small 
scale habitats, while their large scale environments are 
quite different. 

To compare the small-scale environments of different 
classes of AGN, we examine local galaxy density on a 
scale ~ l/i~^Mpc. For the spectroscopic s amples we built 
and a nalyzed in this study, we employ iBlanton et al. I 
l)2005tl galaxy overdensities estimated by counting all 
galaxies around each object in cylinders lh~^ Mpc in 
radius and 1,600 km/s long (in the redshift direction) 
centered on the source. The tracer galaxies are a volume- 
limited sample with 0.017 < z < 0.082, and -23.5 < 
Mr* < —19.5. We find that the median overdensities 
for Seyferts, LINERs and H ii's are 6.15, 6.13, and 4.17, 
respectively, suggesting more crowded environments for 
accretion-powered AGN than for objects with line emis- 
sion dominated by star formation. The passive galaxies 
show the largest median overdensity, 9.39. For compar- 
ison, the median overdensity for the whole galaxy sam- 
ple is 6.34. These results support our inferences from 
estimates of the CF that suggest a higher density envi- 
ronment for passive galaxies and a relatively lower local 
density around the H ii's,. The similarity in galaxy den- 
sities around Seyferts and LINERs on this small scale 
may be a consequence of the Seyferts' extra cluster- 
ing at small scales; the density estimates characterize 
the environment only at scales ^ lh~^ Mpc, while the 
clustering amplitudes, which show LINERs to be more 
strongly clustered, are obtained by fitting the CFs at 
scales 3 < r < 10h~^ Mpc. 

If the typical environments of LINERs are no dif- 
ferent tha n those of galaxies, then m ajor mergers, ha- 
rassment ijLake. Katz fc Moore |[T99^ . or other cluster- 



related mechanisms are unlikely to be the trigger for fu- 
eling nuclear activity in these objects. As discussed by 
iWaskett et al. I l)2005j) . because the number of close com- 
panions of modest luminosity AGN (i.e., LINERs) is very 
simila r to that of non-active galaxies (Was kett et al. I 
2005; Grogin et al. ii.200 3'). and because their spatial dis- 
tribution is unbiased relative to that of the normal galaxy 
population (Miller et al. 2003, Wake et al. 2004, Section 
l3.1.1l of this study), a possible cause of activity in these 
AGN may be minor mergers. 

Seyferts, on the other hand, might still be caused 
by, e.g., galaxy harassment, with these more luminous 
AGN being relatively recent additions to clusters, and 
thus situated either at the periphery or in the process 
of falling into dense regions. A clumpy small-scale spa- 
tial distribution would then be a natural consequence 
of such environments. Such migrants are also more 
likely to have large gas reservoirs and only slightly dis- 
turbed morphologies. We have shown that Seyferts 
are distinct from other emission-line systems in their 
high electron densities. So far, morphological stud- 
ies of Seyfert hosts reach no clear, statistically signif- 
icant conclusion on the degree of harassment in com- 
parison with those of the normal galaxies: Seyferts' 
hosts exhibit similar morphologies with those of field 
galaxies (jie Robertis et al. 19 98a b), do not particu- 
larly show evidence for bars ife.g.. Regan fc Mulcheavl 
Il999t iLaine et al 1 12002|) or for galaxy-galaxy interac- 
tions l|e.g.. Malkan et al~l I1998D , while results of pair 
counting in both optical and IR, remain inconclu- 
sive, som e studies finding possible excess of companions 
i Dahari | pi)84; RafancUi et al. 1995 ), while others not 
iFuentes-Wilhams fc Stocke . .1988: .Laiirikainen fc Salo I 
Il995t iSchmittI l2001() . Minor mergers that leave no or 
very little optical trace ije.g.. Corbin Il2063 l might how- 
ever explain fueling that would originate from a canibal- 
ized secondary, less massive galaxy that thus causes only 
minor perturbations to the primary galaxy. Such events 
might as well happen at the galaxy clusters' peripheries. 

5. SUMMARY & CONCLUSIONS 

We investigate the spatial clustering of low-luminosity 
AGN in the nearby universe, using spectroscopic clas- 
sification and measurement of related physical param- 
eters to examine this clustering as a function of their 
nuclear emission properties. We estimate and compare 
the CFs for sub-samples selected on these various proper- 
ties, and analyze these results in light of the well-known 
dependence of clustering on halo mass, as well as the 
empirical relation between black hole mass and galaxy 
mass. This study reveals a strong connection between 
the galaxy clustering amplitudes, hence the properties 
of their dark matter halos, and the intrinsic physical 
properties of these objects' line-emitting regions. A brief 
overview of our main results is given in Table El and we 
list our conclusions as follows. 

(i) The amplitude of the two-point correlation func- 
tion is strongly dependent on galactic nuclear emission 
properties. By employing detailed spectroscopic classifi- 
cation schemes we find that, contrary to previous claims, 
bona fide low luminosity AGN, spectroscopically classi- 
fied as Seyfert galaxies, are clearly less clustered than the 
average galaxies, and therefore do not tracing the same 
underlying structures. LINERs, on the other hand, ex- 
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hibit a clustering amplitude higher than that of Seyferts, 
and consistent with that of the whole galaxy sample. 

(ii) AGN defined using only the [N II] /Ha vs. 
[O III]/H/3 diagnostic diagram are dominated by the 
LINER-like objects, and include a significant fraction of 
objects with conspicuously strong nuclear star-formation 
activity. Two, or better, three-dimensional diagnostic 
diagrams that employ forbidden lines whose emissivity 
is less sensitive to abundance effects, like [S II] and/or 
[O I], are necessary to accurately separate sources pow- 
ered by accretion from those in which star-forming ac- 
tivity is dominant. 

It is, therefore, important to emphasize that more 
lenient AGN definitions are strongly biased toward 
LINER-like behavior, and consequently, do not repre- 
sent the traditional AGN activity and physical charac- 
teristics. LINERs differ from Seyferts in many of their 
intrinsic characteristics. In LINERs, the emitting gas 
density is lower than in Seyferts, and the obscuration is 
generally at very low levels. As suggested by the large 
differences we observe in the [O I] and [O III] line lumi- 
nosities, the accretion activity is significantly reduced in 
LINERs while in Seyferts is generally high. 

(iii) Emission parameters such as i[oiii] and, in par- 
ticular, L[oi], which are sensitive to the level of nuclear 
accretion activity, play an important role in differenti- 
ating systems of different clustering properties. Highly 
luminous and, therefore, more active objects are signifi- 
cantly less clustered than faint, less active ones; the latter 
types are strongly represented by LINERs. 

(iv) The amount and nature of the fuel available for 
nuclear activity, whether accretion or star-formation, is 
also connected to the spatial clustering characteristics. 
The less obscured systems and the galaxies with mod- 
erately high emitting gas densities are more clustered. 
Such sources are highly represented by LINERs. 

(v) Host galaxy properties and, consequently, the 
morphology-density relation, do not strongly influence 
the difference in the spatial clustering found between 
Seyferts and LINERs, and play only a secondary role in 
the large differences between the clustering amplitudes 
of the low and high i[oi] and/or i[oiii] subsets. 

(vi) Based on the strong relationship between host halo 
mass, as given by the clustering amplitude, and the spec- 
tral properties of the line-emitting galaxies, we specu- 
late that: 1) Seyfert activity arises from active accre- 
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TABLE 1 

Object Sample Definition: Spectral classes. 



Sample 
(1) 


(2) 


[O III]/H/3 
(3) 


[N II] /Hq 
(4) 


[S II] /Hq 
(5) 


[O I]/Hq 
(6) 


Emission-line 


54,410 


> 


> 


> 


> 


Scyferts 


2,392 


> 3 


> 0.6 


> 0.4 


> 0.08 


LINERs 


7,468 


< 3 


> 0.6 


> 0.4 


> 0.08 


...pure liners 


5,313 


< 3 


> 0.6 


> 0.4 


> 0.17 


...transition 


2,155 


< 3 


> 0.6 


> 0.4 


> 0.08,< 0.17 


H II's 


34,781 


> 


< 0.6 


< 0.4 


< 0.08 



Note. — Columns (3) to (6) indicate the range of values in the line flux 
ratios. 



TABLE 2 

Absolute Magnitude Limited Correlation Function Samples 



Sample 




so 


7 




all galaxies 
passive 


45,303 
6,383 


7.80±0.49 
11.42±1.17 


1.17±0.05 
1.28±0.07 


-0.82 
-0.82 


Emission-line* 


12,676 


6.33±0.94 


1.30±0.16 


-0.15 


Scyferts^ 
LINERs'' 


829 
3,263 


5.67±0.62 
7.82±0.64 


1.56±0.17 
1.39±0.09 


-0.58 
-0.53 


...pure LINERs 


2,424 


7.39±0.67 


1.33±0.11 


-0.46 


...transition 


839 


5.38±0.71 


1.35±0.23 


-0.18 


H IPs* 


6,622 


5.81±0.53 


1.28±0.11 


0.08 


Emission- line'' 


21,497 


6.45±0.24 


1.24±0.04 


-0.67 


Seyferts'' 


1,018 


6.00±0.64 


1.41±0.15 


-0.48 


liners'" 


5,326 


7.26±0.61 


1.29±0.08 


-0.73 


H Il'st- 


12,649 


5.73±0.24 


1.22±0.06 


-0.46 


AGN (Kauffmann ct al.) 
non-AGN (Kauffmann ct al.) 
AGN (Miller ct al.) 


14,421 
13,630 
9,846 


7.44±0.37 
6.26±0.69 
7.01±0.43 


1.25±0.07 
1.24±0.11 
1.27±0.06 


-0.44 
-0.52 
-0.63 


non-AGN (Miller et al.) 


15,178 


5.62±0.59 


1.24±0.12 


0.03 


log ^[OIII]'' > 40 


2,041 


6.22±0.57 


1.41±0.11 


-0.24 


38 < log L[oiii]" < 39 
log i[oiii]" < 37 


10,464 


7.17±0.44 


1.21±0.05 


-0.63 


1,580 


8.79±0.75 


1.31±0.10 


-0.54 


log i.[oi]" > 39 
log L[oi]" < 38 


4,561 


6.11±0.27 


1.34±0.07 


-0.36 


2,153 


13.15±1.09 


1.33±0.07 


-0.77 


high Nh 
low Nu 


2,575 
2,921 


6.91±0.57 
7.38±0.51 


1.25±0.09 
1.35±0.07 


-0.19 
-0.68 


high He 


788 


7.50±0.98 


1.15±0.13 


-0.98 


low rie 


3,435 


5.51±0.45 


1.43±0.11 


-0.11 



Note. — All samples use 14.5 < r < 17.7, 0.05 < z < 0.12, and -21.6 < Mr < 
—20.2. So is in units of h^^ Mpc. <''sQ~f/y/o'agCT~f is the normalized correlation 
coefficient between so and 7. 



''definition criteria include [O I], [O I] /Ha 
''definition criteria exclude [O I], [O I] /Ha 
''the line luminosities are in units of erg s~ 



TABLE 3 

Object Sample Definition as a function of rie 



Sample N^bj [S II]A6716/A6731 ne(cm-=^) 

high ne 5,922 < 1.05 > 5 X 10^ 

low ((,■ 27.116 -> 1,32 < 1 X 10^ 
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TABLE 4 

Object Sample Definition as a function of Nh 



Sample 


Nobj 


Nh 


(E - B)v 


low Nh 


19,343 


< 10 X lO^o cm-2 


< 0.20 


high Nh 


8,573 


> 20 X 10^0 cm-2 


> 0.45 



TABLE 5 

Median Values in Absolute Magnitude Limited Samples 



Sample 


Nob, 


C 


u — r 


log L[oi] 


log ifOIIIJ 


Nh 




(1) 


(2) 


(3) 


(4) 


(5) 


(6) 


(7) 


(8) 


emission-line 


12,676 


2.61 


2.31 


38.80 


39.26 


13.44 


1.7 xlQi 


seyferts 


829 


2.82 


2.63 


38.93 


39.81 


11.65 


1.9 xlO^ 


liners 


3,263 


2.93 


2.73 


38.34 


38.65 


9.64 


1.2 xlO^ 


...pure LINERs 


2,424 


2.99 


2.79 


38.22 


38.48 


8.19 


1.2 xl02 


...transition 


839 


2.73 


2.53 


38.70 


39.15 


12.81 


1.0 xlO^ 


H IPs 


6,622 


2.38 


2.01 


38.90 


39.37 


15.92 


1.6 xlQi 



Note. — Col. (2): number of objects in the samples; Col. (3); concentration 
index; Col. (4): u — r color for the host galaxies; Cols. (5) and (6): logarithmic 
values of [O I] and [O III] luminosities in erg s^^; Col. (7): neutral hydrogen column 
density in units of 10'^'^ cm~-^; Col. (8): electron density in cm~^ 



TABLE 6 

Summary of Results ii Interpretation 



. \ Sample 

Properties \ 


H II 


Transition 


Seyferts 


Pure LINERs 


clustering 


weak 


weak 


weak 


strong 










-like galaxies 


fuelling rate 


average 


moderately low 


relatively high 


(very) low 


{L\o\], ■f'fOllll) 


{not low!) 




efficient ? 


inefficient? 


fuel available 


low 


low 


high 


moderately high 










(wide range) 


obscuration 


high 


low 


wide range 


low 


host- 


blue-ish 


red-ish 


red-ish 


red-ish 


-morphology 


late type 


earlier type 


earlier type 


earlier type 






-like Seyferts 




-like Seyferts 


Mbh 


small 


small 


small 


large 


life-time 


•? 


I 


short 


long 
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Fig. 1. — Volume-limited sample definition in redshift-absolutc magnitude space. The smooth curves delineate the SDSS main galaxy 
sample boundaries corresponding to our choice of apparent magnitude limits of 14.5 < rrir < 17.7. The rectangle shows the limits in 
absolute magnitude of the parent sample of galaxies used in this analysis. 
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l:q ;p I /Hr/; 

Fig. 2. — Emission-line diagnostic diagrams with separate density contours for the four subclasses of narrow emission line galaxies that 
have all 6 features (H«, H/3, [O III], [N II], [S II], and [O I]) detected in emission at 2(t significance. The transition objects are shown 
in red. Density contour lines correspond to factors of n of the total number of objects in each class, where n = 0.1,0.2,0.3,0.5,0.7,0.9 
starting from outermost contour. The con tours corresponding t o the Transition objects are overplotted in red, and show the lines with 
n = 0.1,0.3,0.6,0.9. The solid curve is the IKewlev et al. I 120011^ theoretical separation between accretion and star-forming systems, and 
the dotted lines are drawn at ±0.1 dex of this prediction. 
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Fig. 3. — Emission-line diagnostic diagrams showing how the AGN defined using definitions of Miller et al. {left panels, "2-line" AGN 
in red, and "4-line" AGN in magenta and multicolored density contours), and Kaufl^mann et al. (right panels, red + blue points) distribute 
as a function of [S II]/Ha and [O I]/Ha line-fiux ratios. For the latter case, the objects situated below the Kewley et al. curve in the 
[N II] /H« vs. [O IIIJ/H/3 diagram, which are most likely to include contamination from star-forming activity, are overplotted in blue in 
this diagram, and are shown in multicolored contours in the other two diagrams. The density contours presented in Figure |5]are shown in 
the background for comparison. 
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Fig. 4. — Redshift distributions for different subclasses of objects, in the absolute magnitude limited samples. The left panels corresponds 
to spectral classification that does not include conditions on [O I] and [O I] /Ha while the right panels show objects whose classification 
includes these parameters (note the lower number of the emission-line systems, especially at higher redshifts). The histograms are re- 
normalized to have the total number of objects equal to that of the galaxy sample. The solid curves show the corresponding 2nd order 
polynomial fits for subclasses of objects in the absolute magnitude limited samples, while the dotted line shows the polynomial fit for the 
volume limited galaxy sample. 
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Fig. 5. — Distributions in apparent and absolute r magnitude of the hosts of different spectroscopically-selected subclasses, all of which 
were drawn from a volume-limited parent galaxy sample. The histograms are re-normalized to have the total number of objects equal to 
that of the galaxy sample. 
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Fig. 6. — Distributions in log i[oi]i log i'[oill]i intrinsic column density Njf, and [S II] A6716/A6731 line ratio, for different subclasses 
of objects, in the absolute magnitude limited samples. In the latter plot, the shaded areas indicate corresponding values of rie that are 
beyond the minimum and maximum theoretical limits, max rie = 10^, min Ue = 10^ cm. Luminosities are in erg s~^. The histograms are 
re-normalized to have the total number of objects equal to that of the emission-line galaxy sample. 
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Fig. 7. — Upper panels: Comparison of the CFs estimated for the whole galaxy volume limited sample, for the objects that show activity 
in the form of emission lines, and for the "passive" (i.e., relatively normal) galaxies. Lower panels: Comparison of the CFs for different 
spectral classes of objects. The insets in the upper plots show the contours indicating the 68.3%, 90%, and 99% confidence regions for 
the passive and the actively line-emitting galaxies. The left panels correspond to spectral classification that excludes [O I], while the right 
panels show the results for sample definition that includes this feature; here, the CF of the emission-line galaxies shows lower signal-to-noise 
and the power-law fits have larger error contours. The passive galaxies are significantly more clustered than the "active" line-emitting 
galaxies, contours corresponding to the CF's of Scyfcrts, LINERs and H ll's are illustrated in Figure |S] 
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Fig. 8. — Confidence intervals of the power-law fit parameters, so and 7. Contours indicate the two-dimensional 68.3%, 90%, and 99% 
confidence regions for the CF of Seyferts, LINERs, and H II galaxies. The left panels show results for sample definitions that do not employ 
conditions on [O I], while for the right panels conditions on [O I] are used. Each plot shows, for comparison, the results for the volume 
limited galaxy sample; the axis range is the same in all plots. 
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Fig. 9. — Comparison of CF parameters so and 7 for ICQ random subsamples of the volume-limited galaxy sample with the CF for 
Seyferts whose definition includes [O I] . The left panel shows the results for subsamples of galaxies constructed by randomly selecting 1000 

objects per subsamplc, while the right panel shows the results for the case where the sources in the random subsamples are also constrained 
to follow the redsliift distribution of the Seyferts. Note that in both cases, not a single "mock Seyfert" sample has as low a value of so as 
Seyferts display, and that in both parameters the real Seyferts are clearly separated from the mock samples. 




Fig. 10. — Comparison of the CFs estimated for AGN and line emitting iioii-AGN classified using the definitions of Kauffmann et al. 
{left}, and Miller et al. (right). When such AGN samples are analysed, the LINER behavior is dominant, thus, their clustering properties 
are similar to those of the galaxies. 
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s( h"' Mpc) s( h"' Mpc) 

Fig. 11. — The CFs for object samples characterized by diflferent ranges of [O III] (left) and [O I] (right) emission-Une luminosities. 
The galaxy correlation function is also shown for comparison in both panels. Sources with lower I/[oiii] and low ^[01] show significantly 
stronger clustering. Comparisons of the contours corresponding to the 68.3%, 90%, and 99% confidence regions are indicated in the 
inner upper right insets in each plot. 
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Fig. 12. — left: CFs for object samples that are characterized by different electron densities (rie) as obtained from [S II] line ratios (see 
text for details), right: CFs for object samples with high and low levels of intrinsic absorption, Njj. The galaxy correlation function is 
also shown for comparison in each plot. Confidence limits for the power-law fits to the CF for the subsamples of low and high ne and Njj 
respectively are compared and shown in the upper right inset of each panel. 
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Fig. 13. — Distribution in the concentration index C (left), and the u — r color (right) for different subclasses of objects, in the absolute 
magnitude limited samples. The histograms are re-normalized as in Figure |H] Note the similarity between the distributions in these 
parameters for the Seyfert and LINER samples. 
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Fig. 14. — Relationships between the clustering amplitude sq and the median values of the concentration index C (left), and the u — r 
color (right), for the corresponding samples of different emission-line properties. 
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Fig. 15. — left: CFs corresponding to Pure LINERs and Transition objects: the transition systems are significantly less clustered than 
the Pure LINERs. right: Comparison of distributions in C and u — r for these two subsets of LINERs; the histograms are re-normalized 
to have the total number of objects equal to that of the whole LINER sample. Note the slight shift toward bluer and later type hosts for 
the Transition objects. 




Fig. 16. — Comparison of distributions in L[oiii], ^[oi]; ^H, and Ue for Transition objects and Pure LINERs. The histograms are 
re-normalizcd to have the total number of objects equal to that of the whole LINER sample. Transition object properties show a slight 
tendency toward a more H ii-like behavior. 



